METHOD AND APPARATUS FOR GENERATING 
RECOMMENDATIONS FOR CONSUMER PREFERENCE ITEMS 



Field of the Invention 
5 [01] This invention relates to consumer preference items, such as music, 

movies, fashions, books, television shows and other entertainment choices, and to 
methods and apparatus for receiving inputs from a user and generating 
recommendations for such items where the recommended items have a high probability 
that the user will like them. 

10 

Background of the Invention 
[02] In many areas that involve consumer preferences it is often difficult for the 
rg consumer to select items from a large variety of items available in order to create a 
preferred collection of items. This difficulty is often compounded where the number of 
15S available items is so large that it is not possible for the consumer to personally review 
/ each item in order to make a decision whether the item is preferred. For example, a 
y consumer may listen to music and enjoy certain songs. However, with the thousands of 
|^ songs that are available to any given consumer, it is generally not possible for that 
IP consumer to select preferred songs unless the song has been heard or the artist is 
2&* known, etc. Most consumers simply do not have time to listen to thousands of songs in 
order to form preference opinions. Further, in many cases, the user may have to buy 
the items, resulting in large expenditures in order to even attempt a selection. The 
same problem occurs with movies, television shows and other consumer preference 
items where a consumer forms a subjective preference, or liking, for individual items 
25 and wants recommendations to other similar items in order to review them. 

[03] Several prior art attempts have been made to solve this problem. One 
such prior art approach has been to categorize preference items and then, when a 
consumer indicates a preference for one item in such a category, other items in the 
same category are recommended to the consumer. Such an approach is common in 
30 on-line shopping services where the goods to be sold are categorized. When a shopper 
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buys an item in a category, such as a music CD, other CDs are recommended to the 
shopper, the next time the shopper logs on to the site. Alternative selections performed 
by the same artist or artists that composed the music that was purchased by the 
shopper may also be recommended. Suggestions may also be made from categories 
5 that contain preference items that have been previously selected by a "professional" or 
"expert" who has reviewed the items and placed them into categories. These prior art 
systems can make recommendations that are at least within the general area that is of 
interest to the consumer. However, the categories are generally broad and, thus, the 
recommendations are usually only peripherally related to the consumers actual 
10 preferences. 

^ [04] Similar systems can be used to recommend songs. For example, a 

Si consumer may be asked questions in order to determine musical preferences for 
m selected musical "genres", such as popular, jazz, classical, etc or "moods." Once a 
W genre has been selected, the system will select a short list of songs from song 
isy collections or albums that have been previously classified as with the selected genre by 
a music professional or expert as discussed previously. Such a system is available 
\U from Mubu.com or Savagebeast.com, for example. Still other systems, such as 
jp Moodlogic.com, allow other consumers to log onto a website and classify the songs. 
If [05] Other prior art solutions use a known database search engine to perform a 

20 search, such as a word or text search to locate preference items. The results are then 
refined based on the "popularity" of the items discovered so that the relative ranking of 
the located items that are more popular are varied depending on the type of search. 
Such a system is disclosed in U.S. Patent No. 6,006,218. 

[06] Still other solutions use varying forms of digital signal analysis to evaluate 
25 preference items, such as songs. In this approach, sample songs that have been 

indicated as preferred by a customer are analyzed to determine characteristics, such as 
beats per minute and selected beat patterns. The characteristics are then compared to 
a database of characteristics generated from a large collection of songs. Songs in the 
database with statistically similar characteristics are grouped with the sample songs and 
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recommended to the consumer. Examples of systems that operate in this manner are 
provided by Mongomusic.com, Gigabeat.com, Savagebeast.com and Cantametrix.com. 

[07] While the aforementioned systems do generate recommendations, they 
are relatively crude and inaccurate and are capable of generating only a limited number 
5 of recommendations. Therefore, there is a need for a recommendation system that can 
generate substantial numbers of recommended items that accurately reflect a 
consumer's preferences. 

Summary of the Invention 
10 [08] In accordance with the principles of the invention, one illustrative 

S3 embodiment uses a database of consumer preference items, such as songs, movies or 
\1 television shows to generate the recommendations. The database is created from 
}t consumer preference tests in which a large number of respondents comparatively rate a 
W large number of items. The database contains calculated distances between each pair 
15-J of items based on the respondent preference ratings. 

[09] In order to make recommendations from the database, a profile procedure 
iU based on inputs from a customer generates profile sample items that the customer 
i;;fi prefers. These profile sample items are then applied as inputs to the database and 
2 items in the database within a predetermined distance from the profile sample items are 
20 recommended to the customer. 

[10] In one embodiment, the distance used to determine the recommendations 
from the database is fixed and the number of items recommended can be changed by 
varying the distance, and, in another embodiment, the distance can be modified by the 
customer. 

25 [1 1] In still another embodiment, the recommended items are displayed to the 

customer as feedback from the system and the customer can then change the profile 
sample items to refine or expand the recommendations. 

[12] In yet another embodiment, the customer interacts with a local terminal, 
which performs the profile procedure, and the database is contained in a remote server 

30 that may be connected to the local terminal by a network, such as the Internet. 
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Brief Description of the Drawings 
[13] The above and further advantages of the invention may be better 
understood by referring to the following description in conjunction with the 
5 accompanying drawings in which: 

[14] Figure 1 is a block schematic diagram showing an illustrative computer 
system on which the inventive recommendation system can run. 

[15] Figure 2 is a block schematic diagram showing an overall view of one 
embodiment of the inventive recommendation system. 
10 [16] Figure 3 is a flowchart showing the steps in a recommendation process 

Q that operates in accordance with Figure 2. 

SI [17] Figure 4 is a flowchart showing the steps in an illustrative process for 

generating a customer profile. 
® [18] Figure 5 is a flowchart showing the steps in an illustrative process for 

1& 5 1 generating database entries. 



V Detailed Description 

ill [19] Figure 1 illustrates, in schematic form, a computer system suitable for 

^ implementing the inventive preference item recommendation system. In the system 

20 shown in Figure 1 , local terminals, of which terminals 100 and 104 are shown, accept 
input from customers and display recommendations, and are located in areas that are 
convenient to customers. For example, these terminals may be located in a customer's 
home, in or near retail outlets that sell items for which recommendations are generated, 
in kiosks, etc. Terminals 100 and 104 may be personal computer systems, display 

25 terminals, wireless apparatus or other display mechanisms. As will be hereinafter 
described, a customer wishing to use the inventive music recommendation system, 
enters sample preference information into a local terminal, such as terminal 100. 
Terminal 1 00 then generates a customer "profile" for that particular customer. The 
customer profile is forwarded over a network, such as the Internet, to a server 106 that 

30 may be located remotely. 
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[20] The server 106 compares the customer profile generated at the terminal 
100 to a database of preference item information 108 and identifies items that are 
similar to the preference items specified in the customer's profile as indicated by pre- 
calculated distance values in the database 108. The identified items are used as 
5 recommendations. When the recommendations have been obtained, server 106 
forwards them back to local terminal 100, again via the network 102, for display to the 
customer. The customer then may accept the recommendations or may revise the 
sample preferences entered into the system in order to change the customer profile and 
generate new recommendations. For example, the customer may use the 
10 recommendations as new sample preferences information to create a new, more 
Q fbcussed profile. If the profile is changed, the new profile is sent, via network 102 to 
%i server 106 and again compared to the preference item information stored in database 
jfj 108 and the results returned. Calculations of the distance values in database 108 are 
W based on the results of a consumer preference study conducted with other consumers, 
15a rather than professionals or experts, as discussed in detail below. 
% [21] The computer system shown in Figure 1 is illustrative and other 

ffj configurations that differ both architecturally and operationally can also be used with the 
i;p present invention without departing from the spirit and scope of the invention. For 
J J example, the customer profile generator that runs in local terminal 100 and the server 
20 program that runs in server 106 may, in fact, run in the same computer so that the entire 
system operates on a single computer that might be located in a kiosk, for example. In 
addition, a LAN, WAN or other network may be used in place of the Internet 102 as 
shown in Figure 1 . Further, the customer profile generator could also be located in a 
website accessed by the customer over the Internet. 
25 [22] Figure 2 illustrates, in a schematic form, a recommendation process that 

operates in accordance with the principles of the present invention. The process begins 
when a customer enters information into a terminal, such as terminal 100, shown in 
Figure 1 . The purpose of this information is to develop sample preference items that 
represent a customer's preferences in a given area, such as music. Those skilled in the 
30 art would know that these sample preferences could be ascertained in a number of 
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ways. One method to obtain these sample items would be to elicit from the customer 
the names of some items that the customer likes. However, in many cases the 
customer may be able to identify some items, but not enough items to form a basis for 
making recommendations. Consequently, in one embodiment of the invention, the 
5 customer is prompted to respond to displayed choices. In this manner, the customer 
will be guided to selecting enough sample preference items so that accurate 
recommendations can be generated. In general, the displayed choices are arranged to 
reduce or filter the choices so that the profiling process generates enough sample 
preference items to make accurate recommendations, but the customer does not have 
10 to take a complete customer preference test. In particular, in one embodiment, the 
% customer enters information in response to choices displayed at the terminal by a 
2 category filter 202. The displayed choices structure the information entered by the 
;jl customer and reduce the amount of information that must be entered in order to simplify 
| the generation of the customer profile. The choices made by the customer enable a 
15 - profile to be generated for this particular customer. 

'i'i 

Q [23] The selections presented to the customer may act as a filter or screening 

; 2 device to quickly reduce the possible number of choices and make the information entry 
5 faster. For example, the first choices displayed can be a plurality of broad item 
|!2 categories that define potential areas of interest to the customer. Categories that are 
20 not selected by the customer allow the profile generator to eliminate classes of items 
that are of no interest to the customer. In order to ensure complete coverage of all 
possible items, the category choices are broad format descriptors that represent all of 
the items in the database 214. For example, in the case of a music recommendation 
system, the category choices might be music styles, such as 1) new popular; 2) old 
25 popular; 3) new rock; 4) old rock; 5) country; 6) smooth jazz; 7) oldies; 8) hip hop; and 
9) rhythm and blues. The aforementioned categories are for purposes of illustration 
only; different categories could be used that would be known to be equivalent by those 
skilled in the art. 

[24] The displayed categories may also include additional information that will 
30 indicate the types of items in the category and assist the customer in deciding whether 

6 



to select that category. For example, in the case of a music system, each category, or 
music style, may have a list of artists who have recorded songs in that category 
displayed along with the category name so that the consumer can associate brand 
names with the category name. 
5 [25] In response to the category display, the customer may select one or more 

categories that are of interest to him. The customer category selections are indicated 
schematically in Figure 2 as arrow 200 and are provided to the category filter 202. The 
category filter 202 provides the category selections as indicated by arrow 204 to a 
sample profile item generator 206 that further refines the customer profile by generating 
10 and displaying a plurality of profile sample items for each selected category. Each 
p profile sample item consists of information identifying a preference item that represents 
*t\ a subset or a substyle of each selected category. In the case of a music 
0| recommendation system, the profile sample items can be representative songs from 
vg several substyles in each music category. For example, if the customer selected the 
15; J "new popular" music category referenced above, the following songs and artists might 
be displayed: 





[26] 




Artist 


Title 


Rating 




127] 


1. 


Brittany Spears 


Oops! 


1 2345 


■:.iSS2 

20 


[28] 


2. 


N'Sync 


Bye Bye Bye 


1 2345 




[29] 


3. 


Sugar Ray 


Every Morning 


1 2345 




[30] 


4. 


Brian McNight 


Back to One 


1 2345 




[31] 


5. 


Pink 


There You Go 


1 2345 




[32] 


6. 


Vertical Horizon 


Everything 


1 2345 


25 


[33] 


7. 


Santana 


Smooth 


1 2345 



[34] Again, the aforementioned items are for purposes of illustration only and 
other arrangements within the skill of the art could be used. In general, a small number, 
for example 2-3 items, will be displayed for each distinct substyle represented in the 
30 category, although more or less items could be used. The customer is then asked to 
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rate each of the displayed profile sample items with a predetermined rating scale (1-5 in 
the example given above.) The system may assist in this rating by allowing the 
customer to hear or see short excerpts of the preference item. The customer's ratings 
are schematically indicated by arrow 208 and allow the system to judge the customer's 
5 substyle preference. 

[35] When the profile sample items in all of the selected categories have been 
rated by the customer, the ratings information indicated by arrow 212 in Figure 2 is 
applied to an item thresholding operation as indicated by box 214. In particular, the 
number of profile sample items selected by the generator 206 is reduced by discarding 
10 all of those items where the customer's rating falls below a predetermined threshold. 
O For example, in the aforementioned music recommendation system, the thresholding 
Cj operation 214 may discard all profile sample items having a rating of less than 4 so that 

all remaining sample profile items have customer ratings of 4 and 5. Alternatively, the 
ffl thresholding operation may use low scores to assist in the creation of a final profile by 
15\j weighting each song by its score or a number derived from the score in order to arrive 
at an adjusted score or preference. Other alternative arrangements would be obvious 
Hi to those skilled in the art. 

gn [36] Information identifying the profile items selected by means of the 

^: customer's input, schematically illustrated as arrows 216, is then provided to the 

20 recommendation database 220. As previously mentioned, the database 220 contains 
information identifying a large number of consumer preference items arranged and a 
difference table that contains calculated differences between pairs of the items such 
that the table holds difference values that represent the differences between a given 
item and all other items in the database. In accordance with the principles of the 

25 invention, the calculated differences are determined from ratings obtained in a 

consumer preference test conducted with other consumers. The preference test may 
be conducted before a live audience comprising a plurality of consumers who take the 
test together or the consumers may take the test individually at different times, for 
example, by logging onto a specialized website. In any case, a statistically significant 

30 number of consumers should take the test. The database is compiled so that it contains 
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information on all of the preference items that can be identified in customer profiles and, 
of course, many more additional items that will form the basis for the recommendations. 

[37] More specifically, the consumer preference test 210 may consist of a 
single test or a plurality of tests. In one embodiment, in each test, a consumer audience 
5 comprising for example 50-100 respondents is asked to rate a set of consumer 
preference items on a predetermined rating scale. For example, each of 100 
respondents may be asked to rate 500 songs by listening to each song and rating the 
song as to whether they like the song, they are neutral about the song or they do not 
like the song. All of the ratings information is then used to generate the difference table 
10 using a conventional multi-variable analysis operation 222. In general, the database 
information would be periodically compiled in order to add new items. The frequency of 
such compilation would depend on the frequency at which new preference items are 
ifl introduced. For example, in a music recommendation system, database 220 might be 
recompiled every six months in order to add new songs to the database and to adjust 
15 % I preference scores or add new preference scores. 
q [38] In accordance with the principles of the invention, the ratings used during 

\ u s the consumer preference test measure each respondent's preference for a particular 
ffl song, that is, whether the respondent subjectively "likes" or "dislikes" the song. This 
J preference rating is in contrast to prior rating systems which ask respondents to 
20 categorize each song by music category, such as jazz, pop, etc or by some other 

category such as "mood" (romantic, bouncy, etc.) Preference ratings have been found 
to give recommendations that are more accurate because a particular customer may 
still "like" two songs even if they are in different categories. More particularly, it has 
been found that, if many consumers like both of two songs, there is a substantial 
25 probability that another customer who likes one of the songs will also like the other 
song. 

[39] In the analysis operation 222, for each pair of consumer preference items, 
the distance between the items, measured as the difference in the ratings, is calculated 
for every respondent in the consumer preference test. The square of each difference is 
30 then summed. This distance becomes the Euclidean distance squared in N- 
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dimensional space where N is the number of valid respondents. The distance used in 
the distance table may then be taken as the square root of the results, which is the 
Euclidean distance or some other measure such as the Euclidean distance squared. 
Those skilled in the art would realize that other distance measures, such as Chi-square, 
5 variance, Bayesian and other known distance measures, or combinations thereof, could 
be used in place of, or in addition to, the Euclidian distance measure discussed above 
to arrive at a final "distance" measure. An arbitrary scale may be used for the ratings. 
For example, negative opinions, which means the respondent dislikes the preference 
item, may be rated at minus 1; no opinion at 0; preferred opinions at 1 and favorites at 
10 1 .5. Conventional analysis software can be used to generate the difference table. For 
;^ example, software, which is suitable for performing the above analysis, is marketed 
M under the name "Variety Control" by Steve Casey Research, 663 Washington Avenue, 
{5 Santa Fe, New Mexico 87501 . After the computations are completed, a table, which 
v * identifies each pair of songs and specifies the distance measurement between each 
1 5 -4 pair of songs, is stored in database 220. 
q [40] Then the profile sample item information, which is generated by the item 

;*f thresholding step 214 as indicated by arrows 216, is applied to the database 220 by a 
iji recommendation unit 224 that matches information identifying each profile sample item 
2 with information identifying a corresponding item in the database and then selects other 
20 preference items in the database where the distance from the profile sample item to the 
other items is less than or equal to a predetermined distance. This process is repeated 
for each profile sample item in order to produce a collection of recommended 
preference items that are indicated by arrow 226 in Figure 2. For example, in a music 
recommendation system, song titles and artists produced by the profile generation 
25 process are used to select songs located within a predetermined distance in the 
database. The titles and artists of these selected songs are then returned as 
recommendations. These recommendations may be then displayed as indicated in the 
box 228, Information identifying the recommended items may also be returned to the 
customer, as indicated by schematically by arrow 218, to modify or replace the profile 
30 sample items displayed for each category and, therefore, to refine the search. 
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[41] While it might initially appear that, during a consumer preference test, 
each respondent's ratings of the items may have no relation to another respondent's 
ratings, it has been found that many items in the test, in fact, do belong together in the 
sense that they are liked and disliked by substantially the same test respondents. Thus, 
as the distance between two preference items decreases, it is likely that a person, such 
as the customer who is requesting recommendations, who indicates a preference for 
one item will also prefer the other item. Consequently, the inventive method generates 
accurate results in that the recommendations produced are generally preferred by the 
customer. 

[42] Figure 3 is a flowchart that gives an overview of the inventive 
recommendation process. The process starts in step 300 and proceeds to step 302 
where information identifying sample profile items is obtained from a customer by 
means of the profiling process described in connection with Figure 2, blocks 202, 206 
and 214. The sample profile item information is then applied to the difference table in 
the database 220 as indicated in step 304. In step 306, recommended preference items 
in the database 220 are selected by choosing items within a selected distance from the 
sample profile items. 

[43] Next, in step 308, information identifying the recommended items is 
displayed to the user. In step 310, the customer makes a determination whether the 
recommended items are acceptable. If so, the process finishes in step 312. If not, the 
process returns to step 302 where, for example, the recommended items may be used 
to modify the subcategory choices displayed during the profile generation process or 
may be displayed as the subcategory choices. A customer may then rate these new 
choices to obtain new sample profile items in step 302. Steps 304-310 are then 
repeated until acceptable recommendation items are obtained. 

[44] Figure 4 is a flowchart that shows, in more detail, a process for generating 
profile items as described above in connection with Figure 2. In particular, the process 
starts in step 400 and proceeds to step 402 where profile categories are displayed to 
the user, for example on the local terminal 100 or by another means. 
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[45] In step 404, category selections are made by, and received from, the 
customer, for example, by using a keyboard, mouse or other selection device. 

[46] Next in step 406, profile items corresponding to subclasses of each 
category are displayed and, in step 408, ratings of each of the displayed profile items 
5 are received from the user, again by means of a keyboard, mouse or other selection 
device. 

[47] In step 410, a thresholding process is used to select profile items with 
ratings greater than a predetermined threshold value. In step 412, the selected profile 
items are displayed to the customer to allow the customer to confirm his choice. In step 
10 414, the customer makes a determination whether the selected profile items are 

□ acceptable, if so, the process finishes in step 41 6. If not, the process returns back to 
ifl step 406 in which the profile items in the selected categories are redisplayed to allow 
iu the user to re-rate the items in order to refine the profile. Steps 408-414 are then 

repeated until acceptable profile items are obtained and the process finishes in step 
15^ 416. 

[48] Figure 5 is a flowchart that illustrates, in more detail, the creation of the 
distance table in the recommendation database 220 in accordance with the principles of 
the present invention. This process starts in step 500 and proceeds to step 502 where 

□ a consumer preference test is conducted on a plurality of consumers. The consumers 
20 s may consist of paid or unpaid respondents. For example, a preference test may consist 

of 100 respondents. In step 502, representative preference items are presented to the 
test respondents. For example, the respondents may be asked to rate 500 songs each. 
A typical manner of performing such a test is to play the songs, or portions of the songs, 
in an auditorium. Alternatively, the songs may be played for each consumer individually 

25 if the consumers take the test individually. Each respondent listens to the song and 
then rates the song. Such a test is called an "auditorium test". Similar tests can be 
used for movies, television shows or other consumer preference items. 

[49] Next in step 504, ratings of each of the survey items are obtained from 
each of the test respondents. In general, such ratings may consist of a numerical 

30 rating, a rating scale or a like/don't like rating. Next, in step 506, the distance between 
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each pair of preference items is calculated for each test respondent. As previously 
mentioned, this distance can be simply calculated by subtracting the difference between 
the rating scores for each pair of respondents. 

[50] Next, as indicated in step 508, the distances for each pair of preference 
items are combined, for example, by squaring and summing the distances and then 
possibly scaling the distances, for example, by adjusting the differences to fit on a 
predetermined scale. Next, in step 510, information identifying each preference item 
and the scaled distances are stored in the database table. The routine then finishes in 
step 512, 

[51] A software implementation of the above-described embodiment may 
comprise a series of computer instructions either fixed on a tangible medium, such as a 
computer readable medium, e.g. a diskette, a CD-ROM, a ROM memory, or a fixed 
disk, or transmissible to a computer system, via a modem or other interface device over 
a medium. The medium either can be a tangible medium, including, but not limited to, 
optical or analog communications lines, or may be implemented with wireless 
techniques, including but not limited to microwave, infrared or other transmission 
techniques. It may also be the Internet. The series of computer instructions embodies 
all or part of the functionality previously described herein with respect to the invention. 
Those skilled in the art will appreciate that such computer instructions can be written in 
a number of programming languages for use with many computer architectures or 
operating systems. Further, such instructions may be stored using any memory 
technology, present or future, including, but not limited to, semiconductor, magnetic, 
optical or other memory devices, or transmitted using any communications technology, 
present or future, including but not limited to optical, infrared, microwave, or other 
transmission technologies. It is contemplated that such a computer program product 
may be distributed as removable media with accompanying printed or electronic 
documentation, e.g., shrink wrapped software, pre-loaded with a computer system, e.g., 
on system ROM or fixed disk, or distributed from a server or electronic bulletin board 
over a network, e.g., the Internet or World Wide Web or cellular links. 
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[52] Although an exemplary embodiment of the invention has been disclosed, it 
will be apparent to those skilled in the art that various changes and modifications can be 
made which will achieve some of the advantages of the invention without departing from 
the spirit and scope of the invention. For example, it will be obvious to those reasonably 
skilled in the art that, although the description was directed to particular preference 
items, such as songs, movies or television shows, that almost any item for which a 
customer can form a subjective like or dislike is amenable to the inventive 
recommendation process. Other aspects, such as the specific instructions utilized to 
achieve a particular function, as well as other modifications to particular processes or 
routines used to achieve a function are intended to be covered by the appended claims. 

[53] What is claimed is: 
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